NSF PAR Search | NSF Public Access Repository

Note: When clicking on a Digital Object Identifier (DOI) number, you will be taken to an external site maintained by the publisher. Some full text articles may not yet be available without a charge during the embargo (administrative interval).
What is a DOI Number?

Some links on this page may take you to non-federal websites. Their policies may differ from this site.

Formally Verifying Deep Reinforcement Learning Controllers with Lyapunov Barrier Certificates

Mandal, Udayan; Amir, Guy; Wu, Haoze; Daukantas, Ieva; Newell, Fletcher Lee; Ravaioli, Umberto J; Meng, Baoluo; Durling, Michael; Ganai, Milan; Shim, Tobey; et al (October 2024, TU Wien Academic Press)
Narodytska, Nina; Ruemmer, Philipp (Ed.)
Deep reinforcement learning (DRL) is a powerful machine learning paradigm for generating agents that control autonomous systems. However, the “black box” nature of DRL agents limits their deployment in real-world safety-critical applications. A promising approach for providing strong guarantees on an agent's behavior is to use Neural Lyapunov Barrier (NLB) certifcates, which are learned functions over the system whose properties indirectly imply that an agent behaves as desired. However, NLB-based certifcates are typically diffcult to learn and even more diffcult to verify, especially for complex systems. In this work, we present a novel method for training and verifying NLB-based certifcates for discrete-time systems. Specifcally, we introduce a technique for certifcate composition, which simplifes the verifcation of highly-complex systems by strategically designing a sequence of certifcates. When jointly verifed with neural network verifcation engines, these certifcates provide a formal guarantee that a DRL agent both achieves its goals and avoids unsafe behavior. Furthermore, we introduce a technique for certifcate fltering, which signifcantly simplifes the process of producing formally verifed certifcates. We demonstrate the merits of our approach with a case study on providing safety and liveness guarantees for a DRL-controlled spacecraft.
more » « less
Full Text Available
Safe and Reliable Training of Learning-Based Aerospace Controllers

https://doi.org/10.1109/DASC62030.2024.10749499

Mandal, Udayan; Amir, Guy; Wu, Haoze; Daukantas, Ieva; Newell, Fletcher Lee; Ravaioli, Umberto; Meng, Baoluo; Durling, Michael; Hobbs, Kerianne; Ganai, Milan; et al (September 2024, IEEE)

In recent years, deep reinforcement learning (DRL) approaches have generated highly successful controllers for a myriad of complex domains. However, the opaque nature of these models limits their applicability in aerospace systems and sasfety-critical domains, in which a single mistake can have dire consequences. In this paper, we present novel advancements in both the training and verification of DRL controllers, which can help ensure their safe behavior. We showcase a design-for-verification approach utilizing k-induction and demonstrate its use in verifying liveness properties. In addition, we also give a brief overview of neural Lyapunov Barrier certificates and summarize their capabilities on a case study. Finally, we describe several other novel reachability-based approaches which, despite failing to provide guarantees of interest, could be effective for verification of other DRL systems, and could be of further interest to the community.
more » « less
Full Text Available
Formal Verification Tool Evaluation For Unmanned Aircraft Containing Complex Functions

https://doi.org/10.1109/DASC50938.2020.9256529

Herencia-Zapana, Heber; Lopez, James; Gallagher, Glen; Meng, Baoluo; Patterson, Cameron; Maalolan, Lakshman (October 2020, 2020 AIAA/IEEE 39th Digital Avionics Systems Conference (DASC))
null (Ed.)
Full Text Available
Towards Verification of Neural Networks for Small Unmanned Aircraft Collision Avoidance

https://doi.org/10.1109/DASC50938.2020.9256616

Irfan, Ahmed; Julian, Kyle D.; Wu, Haoze; Barrett, Clark; Kochenderfer, Mykel J.; Meng, Baoluo; Lopez, James (October 2020, Proceedings of the 39th Digital Avionics Systems Conference (DASC '20))

The ACAS X family of aircraft collision avoidance systems uses large numeric lookup tables to make decisions. Recent work used a deep neural network to approximate and compress a collision avoidance table, and simulations showed that the neural network performance was comparable to the original table. Consequently, neural network representations are being explored for use on small aircraft with limited storage capacity. However, the black-box nature of deep neural networks raises safety concerns because simulation results are not exhaustive. This work takes steps towards addressing these concerns by applying formal methods to analyze the behavior of collision avoidance neural networks both in isolation and in a closed-loop system. We evaluate our approach on a specific set of collision avoidance networks and show that even though the networks are not always locally robust, their closed-loop behavior ensures that they will not reach an unsafe (collision) state.
more » « less
Full Text Available

Search for: All records